A comprehensive characterization of recurrences in time series 
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Study of recurrences in earthquakes, climate, financial time-series, etc. is crucial to better forecast 
disasters and limit their consequences. However, almost all the previous phenomenological studies 
involved only a long-ranged autocorrelation function, or disregarded the multi-scaling properties in- 
duced by potential higher order dependencies. Consequently, they missed the facts that non-linear 
dependences do impact both the statistics and dynamics of recurrence times, and that scaling argu- 
ments for the unconditional distribution may not be applicable. We argue that copulas is the correct 
model-free framework to study non-linear dependencies in time series and related concepts like re- 
currences. Fitting and/or simulating the intertemporal distribution of recurrence intervals is very 
much system specific, and cannot actually benefit from universal features, in contrast to the previ- 
ous claims. This has important implications in epilepsy prognosis and financial risk management 
applications. 
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Extreme events are widely studied in seismicity, as- 
tronomy, physiology, finance, etc. [l| In particuliar, the 
inter-occurence times (or recurrence intervals), i.e. the 
periods between two occurences of the observed phe- 
nomenon that exceed a given threshold, have important 
implications in risk management in a view to predict the 
advent of such extreme events or characterize aftershocks. 

A consensus has emerged on the statistics of recur- 
rence times from many phenomenological studies on real 
or simulated data with long-ranged correlations: the un- 
conditional probability distribution function (PDF) of 
recurrence intervals r follows an (approximate) scaling 
relation [A- 



(1) 



where r c is a characteristic recurrence time for a given 
time series and a choice of event-triggering threshold. 
Furthermore, the sequence of recurrence times exhibits 
long-term correlation 0, [E| . 

Yet we show that this consensus, founded on the mis- 
conception that "nonlincaritics arc not needed to explain 
the properties studied" Q, is incorrrect since the whole 
non-linear dependences affect 7r(r). Some ad hoc at- 
tempts at bringing in non-linearities and/or multi-scaling 
have also been made, in view of modelling the behavior of 
a specific system (e.g. financial returns with multifractal 
log- volatility @). In this paper we propose that copulas 
is the correct model-free theoretical framework to study 
non-linear dependencies in time series and related con- 
cepts like recurrences, and stress the fact that non-linear 
correlations and multi-point dependences are relevant. In 
passing, a characterization of clustering based on the au- 
tocorrelation of recurrence intervals is an oversimplified 
view of the reality. We also argue that a scaling rela- 
tion of the form ([1]) is at best approximate, and would 
hold for processes exhibiting a time-dependence charac- 
terized by a unique time scale r c . More importantly, no 



such simple scaling is expected either in the general case 
of non-linear dependences, or when several characteristic 
time scales are involved in the dependence structure. 

We consider a time series of length T, that can 
be seen as one realization of a corresponding discrete 
stochastic process {X t } t= i...T- The joint cumulative dis- 
tribution function (CDF) of n occurrences (1 < t\ < 
. . . < t n < T) of the process is 



t (x) =V[X tl <x tl ,...,X tn <x tn \. (2) 



We assume that the generating process {X t } of the se- 
ries has a strong-stationary marginal CDF F, and let 
the joint distribution T have long-ranged dependences, 
as is typically the case e.g. for seismic and financial data. 
Strictly speaking, most of the discussion that follows only 
relics on time-homogeneity of the copula, i.e. transla- 
tional invariance of the dependence structure, and does 
not need stationarity of the marginal. However, in this 
class of problems much related to empirical data, it is 
more convenient (although sometimes not justified) to 
assume at least a weak form of stationarity, in order to 
give sense to the notion of empirical CDF. In the follow- 
ing, we will always assume stationarity of the marginal 
and homogeneity of the copula. 

A realization of X t at date t will be called an "event" 
when its value exceeds a threshold X {+) . In the fol- 
lowing we will consider a threshold defined as the up- 
per p + -quantile of the marginal distribution: X (+) = 
F- 1 {l-p+). 

Although the n-points expectations of Gaussian pro- 
cesses reduce to all combinations of 2-points expectations 

(X t X t ,)=p(\t-t'\), 

their full dependence structure is not reducible to the bi- 
variatc distribution, unless the process is also Markovian 
(i.e. only in the particular case of exponential correla- 
tion). Furthermore, when the process is not Gaussian, 
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even the multi-linear correlations are irreducible. In the 
general case, the whole multivariate CDF T is needed 
but, as we show just below, recurrences only involve the 
diagonal n-points copula: 

C n (p) = Ft+UMiF- 1 ^), . . .,F-\p)), 

which measures the joint probability that all n > 1 con- 
secutive variables X t +i, . . . , X t + n are below the upper 
p-th quantile of the stationnary distribution (p £ [0, 1], 
and t+ [l,n] is a shorthand for {t+1, . . . , t+n}). Clearly, 
C\ (p) — p and we set by convention Cq (p) = 1 . 
As an example, the Gaussian diagonal copula is 

C„(p) = $ p (<D- 1 (p),...,<I)- 1 (p)) (3) 

where <£> -1 is the univariate inverse CDF, and $ p denotes 
the multivariate CDF with (n x n) covariance matrix p, 
which is Toeplitz with symmetric entries 

Ptt> = p(\t - t'\), t,t' = 1, ...,n. 

The White Noise product copula C n (p) — p n is recov- 
ered in the limit of vanishing correlations p(£) = 0W, 
and other examples include the exponentially correlated 
Markovian Gaussian Noise, the power-law correlated 
(thus scale-free) Fractional Gaussian Noise, and the log- 
arithmically correlated multifractal Gaussian Noise. 

Empirically, the n-points probabilities are very hard 
to measure due to the large noise associated with such 
rare joint occurences. However, there exist observables 
that embed many-points properties and are more easily 
measured, such as the length of sequences (clusters) of 
thresholded events Q, and the recurrence times of such 
events, that we study here. 

The probability 7r(r) of observing a recurrence inter- 
val r between two events is the conditional probability of 
observing a sequence of r — 1 "non-events" bordered by 
two events: 

tt(t) = P[A T >A< + \A [1;r [<X (+ >|Ao>X (+) ]. 

After a simple algebraic transformation flipping all '>' 
signs to '<', it can be written in the language of copulas 
as: 

, x C r -i(l- P+ )-2C r (l-p + )+C T+ i(l-p+) . 
tt(t) = — . (4) 

The cumulative distribution 

H(r) = Y Tr(n) = 1 - Cr(l- P+ )-C T+1 (l-P + ) 

is more appropriate for empirical purposes, being less 
sensitive to noise. These exact expressions make clear — 
almost straight from the definition — that (i) the distri- 
bution of recurrence times depends only on the copula of 
the underlying process and not on the stationary law, in 



particular its domain or its tails (this is because we take a 
relative definition of the threshold as a quantile); (ii) non- 
linear dependences are highly relevant in the statistics of 
recurrences, so that linear correlations can in the gen- 
eral case by no means explain alone the properties of 
7t(t) Q; and (iii) recurrence intervals have a long mem- 
ory revealed by the (r + l)-points copula being involved, 
so that only when the underlying process X t is Marko- 
vian will the recurrences themselves be memoryless.0 
Hence, when the copula is known (Eq. <j3j> for Gaussian 
processes), the distribution of recurrence times is exactly 
characterized by the analytical expression in Eq. (|4]). 

The average recurrence time is found straightforwardly 
by summing the series 

oo 

(r)=J2rn(T) = — , (5) 

and is universal whatever the dependence structure. This 
result was first stated and proven by Kac [l(| , in a similar 
fashion. It is intuitive as, for a given threshold, the whole 
time series is the succession of a fixed number p+T of 
recurrences whose lenghts r, necessarily add up to the 
total size T, so that (r) = Y< 1 t i/(p+ t ) = 1 /p+- Notc 
that Eq. (0 assumes an infinite range for the possible lags 
r, which is achieved either by having an infinitely long 
time series, or more practically when the translational- 
invariant copula is periodic at the boundaries of the time 
series, as is typically the case for artificial data which are 
simulated using numerical Fourier Transform methods. 
Introducing the copula allows to emphasize the validity 
of the statement even in the presence of non-linear long- 
term dependences, as Eq. ([S]) means that the average 
recurrence interval is copula- independent. 

The variance of the distribution, cr 2 = (r 2 ) — (t) 2 can 
be computed as well by summing r 2 7r(r) over r: 

P+ P+ 

It is not universal, in contrast with the mean, and can 
be related to the average unconditional waiting time 0] ■ 
The universality of the average recurrence time has 
important implications for the potential scaling proper- 
ties of the PDF of recurrence times. Indeed, it has been 
believed that 7r(r) can be fitted by a unique scaling func- 
tion Eq. (JTJ) , with / depending on the underlying process 
and its dependence structure. Such a scaling would be of 
paramount importance in the empirical investigation of 
extreme events (for which p + is close to and there is of- 
ten too few data points to conduct a thorough statistical 
study), as it would make it possible to extrapolate the 
distribution found at low thresholds to large ones. Now, 
because there is a one-to-one correspondance between (t) 
and p+ following the universality ([5]) , the natural scale r c 
is necessarily the mean recurrence times, and the relation 
(TTJ) describes in fact a scaling of it with the threshold. 
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Possible and reported scaling functions include: expo- 
nential decay ln/(x) ~ x (which corresponds to indepen- 
dent arrivals), power-law decay ln/(x) ~ In a:, stretched 
exponential (generalized Gamma) In In f(x) ~ lnx, and 
others, e.g. mixed stretched exponential + power-law, 
possibly due to finite size and discreteness effects 0, 0] . 

On the theoretical side, the fact that the distribution 
of recurrence times exhibits a universal scaling ranging 
over several orders of magnitude has been shown to be 
related to criticality in long-ranged correlated complex 
systems, like invariance under Rcnormalization- Group 
transformation ll|. But such a universal scaling can 
only exist for linearly correlated processes Q , and there 



is even a concern [12| whether there exist at all fixed- 
point solutions of the RG equations other than the trivial 
exponential function corresponding to independent ar- 
rivals. Therefore, since the average recurrence time does 
not carry any information whatsoever about the depen- 
dence structure of the process, a scaling relation of the 
form (|Tj) is a trivial one when r c = (r) , and it might hold 
for processes exhibiting a time-dependence fully char- 
acterized by linear correlations, and no other relevant 
time scale; this is the case only for the power-law corre- 
lated Gaussian process [l3j], for which 7r(r) can anyways 
be expressed analytically according to Eqs. (|3|4|) with 
p(i) ~ £~ 7 . But no such simple scaling is expected in 
the general case when either non-linear dependences are 
present, and/or when several time scales are involved in 
the dependence structure. 



We illustrate this on Fig. 1 (a) for the daily log- returns 
of the IBM stock from 1962 to 2010 (same data as in 
14 1, with however a sign flip since our definition of the 
threshold is upside whereas there it is downside). Fi- 

I 



nancial returns have a very short term linear correla- 
tion, a very long term and multifractal quadratic correla- 
tion, and other kinds of intermediate-scale dependences 
(e.g. leverage effect) [Hj], that show up in the copula in 



a very peculiar fashion [16[. Hence, as expected [1 



the distribution of recurrence times has a more complex 
functional form than what is allowed by the scaling in 
Eq. ([T]), as reproduced on the left panel. One may note 
by the way that the correlation of the amplitudes (volatil- 
ity clustering) is the dominant one, so that the series of 
u>t = log |A t is mostly linearly correlated and in fact 
the recurrence intervals distribution of w+does exhibit 
an approximate scaling of the type ([T]) 0, [13] ■ 

We also illustrate the non-scaling behavior on measure- 
ments of electroencephalograms (EEG) on brain surface 
of patients awake with eyes open (set A of the dataset 
studied in Ref. [HI). It consists of 100 stationary series 
of 4097 observations each; once centered and rescaled, all 
series have same distribution. The results are shown on 



Fig. 1(b) the presence of several (in fact periodic) scales 
in the dependence structure forbids the scaling of 7r(r) 
with (t). 

The dynamics of recurrence times is as important 
as their statistical properties, and in fact impacts the 
empirical determination of the latter fl9i | . It is now clear, 
both from empirical evidences and analytically from the 
discussion on Eq. (QJ, that recurrence intervals have a 
long memory. In dynamic terms, this means that their 
occurences show some clustering. The natural question 
is then: "Conditionally on an observed recurrence time, 
what is the probability distribution of the next one?" 
This probability of observing an interval r' immediately 
following an observed recurrence time r is 



V{X T+T , > x { +\x T+ll . Tll < x^\x T > x^\x {1 . Tl < X {+ \X a > A< +) ]. 



(7) 



Again, flipping the '>' to '<' allows to decomposed it as 

C t -1;t'-1 — Ct;t'-1 ~ Ct-1;t' + C T - T ' _ 7t(t + t') 
C t -1 — 2C r + C r +1 7t(t ) 

where the (T+r')-points probability 

Cr;r'(p) = J"[ 0; r+r']\{r } (F~ 1 (p) , ■ ■ -^^(p)) 

shows up. Of course, this exact expression has no prac- 
tical use, again because there is no hope of empiri- 
cally measuring any many-points probabilities of extreme 
events with a meaningfull signal-to-noise ratio. We rather 
want to stress that non-linear correlations and multi- 
points dependences are relevant, and that a character- 
ization of clustering based on the autocorrelation of re- 
currence intervals is an oversimplified view of reality. 



r 



The exact universality of the mean recurrence inter- 
val imposes a natural scale in the system. A scaling re- 
lation in the distribution of such recurrences is only pos- 
sible in absence of any other characteristic time. When 
such additional characteristic times are present (typically 
in the non-linear correlations), the rescaling must be per- 
formed with a scale-dependent rcnormalization, in the 
spirit of 



tt(t) 



(r) 



(8) 



where a T is defined in Eq. (|6]) and does contain infor- 
mation about the dependence structure. Then in some 
cases where c T embeds all the dependence of the under- 
lying process, the scaling ([SJ might hold, at least ap- 
proximately, even in the presence of several scales, as the 
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recurrence times corresponding to the different regimes 
would be renormalized accordingly. 

We stress that recurrences are intrinsically multi- 
points objects related to the non-linear dependences in 
the underlying time-series. As such, their autocorrela- 
tion is not a reliable measure of their dynamics, for their 
conditional occurence probability is much history depen- 
dent. We report elsewhere Q more properties of recur- 
rence times and the statistics of other observablcs (wait- 
ing times, cluster sizes, records, aftershocks) in light of 
their decription in terms of the diagonal copula. We hope 
that these studies can shed light on the n-points prop- 
erties of the process by assessing the statistics of simple 
variables rather than positing an a priori 2-points corre- 
lation structure and deriving a corresponding recurrence 
times distribution. 
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(a)IBM daily stock returns. 
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